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Given a pure state vector \x) and a density matrix p, the function p(x\p) = (x\p\x) 
defines a probability density on the space of pure states parameterised by density 
matrices. The associated Fisher- Rao information measure is used to define a uni- 
tary invariant Riemannian metric on the space of density matrices. An alternative 
derivation of the metric, based on square-root density matrices and trace norms, 
is provided. This is applied to the problem of quantum-state estimation. In the 
simplest case of unitary parameter estimation, new higher-order corrections to the 
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^ , uncertainty relations, applicable to general mixed states, are derived. 
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The structure of the space T> of density matrices is of considerable interest in quantum 
information theory and related disciplines such as quantum tomography and quantum sta- 
tistical inference. While substantial progress has been made in unravelling the structure 
q-i of V [IrB], the subject remains difficult, partly because of the complicated structure of V, 

which is a manifold with a boundary given by degenerate density matrices. The purpose of 
CN| ■ the present paper is to reveal the existence of a remarkably simple dual metric structure over 
the space of pure and mixed states, and to apply this to obtain new higher-order corrections 
to Heisenberg uncertainty relations for general mixed states. 

The idea of the dual metric structure can be sketched as follows. If \x) denotes a nor- 
malised pure state vector and p a generic density matrix, then the expectation (x\p\x) defines 
a probability density function either on the space of pure states parameterised by density 
matrices, or on the space of density matrices parameterised by pure states. Hence, methods 
of information geometry can be employed to define a pair of Riemannian metrics, one 
for the pure state space and one for the space of density matrices. 

An alternative formulation is to use the trace norm and matrix algebra associated with the 
Hermitian square-root map p — > £ = yjp. It is shown that if £ is parameterised by a pure state 
so that £ = p, then the associated information metric is the Fubini-Study metric; whereas if 
£ is parameterised by a generic density matrix, then we recover the information metric on T>. 
Apart from these two extremes, however, there are many other cases of considerable interest. 
Typically, in a problem of quantum state estimation, the density matrix p(6) depends on one 
or several unknown parameters {8 1 } that are to be estimated. The Fisher-Rao information 
metric 4tr[<9j£(#)<9,'£(#)], where <9j = d/dd\ then determines the covariance lower bound of 
the parameter estimates. In the simplest case in which p(8) represents a one-parameter 
unitary curve in the space of density matrices (e.g., 9 represents time), the quantum Fisher- 
Rao metric coincides with the "skew information" defined by Wigner and Yanase By 



this device we are able to extend the results of [1Q|-|15| on quantum statistical inference. In 
particular, we derive: (a) a variance lower bound for general mixed states, which is sharper 
than the previously known ones that are expressed in terms of the skew information; (b) 
the quantum-mechanical analogue of the conditional variance representation in terms of 
the skew information measures; and (c) higher-order corrections to the uncertainty lower 
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bound for mixed states, whereby the concept of higher-order skew information measures are 
introduced. 

Consider a finite, n-dimensional quantum system. The space of pure states r is the space 
of rays through the origin of C(n), which is just the complex projective space equipped with 
the unitary invariant Fubini-Study metric. We let T> denote the space of density matrices on 
C(n). Every density matrix defines a positive scalar function (x\p\x) on r, and this, properly 
normalised with respect to the Fubini-Study volume element, yields a probability density on 
r. Since, by the real polarisation identity, a density matrix is uniquely determined by the 
totality of its expectation values, these probability distributions are parameterised by V in 
a one-to-one fashion. Hence we can use the Hellinger distances between these distributions 
and obtain an information metric on T> to investigate properties of T>, which are directly 
related with the physical significance of density matrices. The dual picture emerges from 
the converse construction: Given any \x) G r, one obtains a nonzero nonnegative function 
(x\p\x) on V, which defines, when properly normalised, a set of probability distributions on 
T>. This likewise yields an information metric on r. By considerations of symmetry and 
invariance, using the unitary structure of C n , one can perceive that this a priori new metric 
on r can only be the Fubini-Study metric. 

If one assumes the Schrodinger equations of motion \x{t)) = exp(— iHt)\x(0)) on r and 
the Heisenberg equations of motion p(t) = exp(—iHt)p(0) exp(iHt) on T>, then the following 
conclusions can be drawn. The evolution on r is isometric with respect to the Fubini- 
Study metric (known as Wigner's theorem), and, moreover, the evolution on T> is isometric 
with respect to the information metric. Thus, we have a complete reciprocity between the 
Schrodinger dynamics on r, equipped with the Fubini-Study metric, and the Heisenberg 
dynamics on £>, endowed with the information metric. 

The components of these information metrics, with respect to a given choice of coordinate 
system, can be calculated by use of the standard expression for the Fisher-Rao metric. Let 
us write {x M } M= i v .. )2 „_2 for the coordinates on r with volume element dv x , and {p a } a =i,...,n 2 -i 
for the coordinates on T> with volume element dV p . Likewise, we write p(x\p) oc (x\p\x) for 
the normalised density on r and P(p\x) oc (x\p\x) for the normalised density on T>. Then 
the information metric G a b on D is given by 

Similarly, the information (Fubini-Study) metric g pv on r is given by 

We therefore have two statistical manifolds, r with the metric g^ u , and T> with the metric 
Gab', each of these is represented by probability distributions on the other. 

The duality of information metrics leads to integral representations for observable ex- 
pectations. Let us consider trace-free observables A. Defining A(x) = (x\A\x) and 
A(p) = ti(pA), the dual formulae read 

A{p)= / A(x)p{x\p) dv x and A(x) = / A(p)P{p\x)dV p . (3) 
J r Jv 

The first formula is a special case of Gibbons' explicit formula for the trace of a product of 



two observables [161 ] . The second is proved using Gibbons' formula along with invariance 
arguments. 
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We remark that the space of density matrices T> is a manifold with a piecewise smooth 
boundary, the latter consisting of the degenerate density matrices, i.e. those with at least 
one zero eigenvalue. The structure of V is thus analogous to that of the unit n — 1 simplex, 
which is the space of probability distributions on a set of n points. Information geometry 
is indeed applicable, mutatis mutandis, to manifolds with boundary. Thus, defining the 
information metric on T> in the above described manner, one obtains a volume element with 
the necessary invariance properties to prove the dual integral formulae. In particular, the 
Heisenberg dynamics on T> defines a Killing field with respect to this metric, which leaves 
the set of degenerate matrices (i.e. the boundary) invariant, that is, the Killing field is 
tangent to the boundary. 

An alternative approach is to label density matrices by their Hermitian square roots. 
The significance of the square-root map in the context of classical probability is outlined in 
Rao's seminal paper jsj; also elaborated in jsj]. The key formulation here in the quantum 
context is to consider not arbitrary square root of the density matrix, but the Hermitian 
square root, which provides the natural extension of the classical theory. Thus, we write 
P = where £ is an arbitrary Hermitian matrix such that tr(£ 2 ) = 1. If p is a pure 
state so that p = £ = then the quantum Fisher- Rao information metric is given by 

g^ v = 4ti(d^d u ^,), where = d/dx^ 1 , which is just the Fubini-Study metric <^j. On the 
other hand, if p is a generic mixed state the quantum Fisher- Rao metric is G a b = 4tr(<9 a £<96£), 
where d a = d/dp a , and this is just the information metric ([T]). 

As an illustration, consider the 2x2 case. A generic Hermitian matrix £ can be expressed 
in the form 

i-dtlM'?)- (4) 



x + iy t — z 

For a pure state we write (x 1 ,^ 2 ) = (0,(f)), where t — |, x — Isin^cos^, y = | sin 6* sin 0, 
and z = | cos 9. Calculating g^ v = itr^d^d^), we obtain the usual spherical metric for the 
space of pure states. For a mixed state we write (p 1 , p 2 , p 3 ) = (u, 9, 4>), where t = (| — m 2 ) 1 / 2 , 

x = usm9 cos0, y = u sin 9 sin 0, and z = ucos9. Calculating G a b = 4tr(<9 a £<9(,£) we obtain 
the usual hyperbolic metric for a ball of radius | (interior of the Bloch sphere). 

The richness of the geometry of the space of density matrices, even in the 2x2 example, 
is elucidated further by the following analysis. We square (@J to obtain 

, _ ft 2 + x 2 + y 2 + z 2 + 2tz 2t(x-iy) 
P ~\ 2t(x + iy) t 2 + x 2 + y 2 + z 2 - 2tz 



(5) 



By construction p is positive; thus we need only consider the trace condition 



tr(p) = l t 2 + x 2 + y 2 + z 2 = \. (6) 

Therefore, under the Hermitian square-root map the space of 2 x 2 density matrices is 
mapped to a sphere S 3 C E 4 . Making use of the trace condition ([6]) we can write 



i + 2tz 2tx - 



■> 



c - i(2ty) \ 

1 o^^ 



2tx + i{2ty) \-2tz I 



(7) 
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where t is determined by (x,y,z) via (Q. Clearly, under the reflection (t,x,y,z) — > 
(—t, —x, —y, —z) we have p — > p. This, however, is not the only degeneracy. While a density 
matrix has a unique positive semidefinite square root £ and a unique negative semidefi- 
nite square root — £, there also exist square roots which are neither positive nor negative 
semidefinite. 

To work out the roots, suppose that a density matrix 

is given. How many points on S 3 give rise to the same g of (jSJ)? By comparing (j7|) and (jSJ) 
we obtain: 



2tx = b, 2ty = c, 2tz = a. (9) 

To find the intersection of (JBj) and ([2]) we eliminate (x, y, z) using (jUj) to obtain 4t 4 — 2t 2 +R 2 = 
0, where i? 2 = a 2 + 6 2 + c 2 (clearly, < R 2 < 4). This quartic equation admits in general 
four solutions: 



t=± V — ^ — ■ (10) 

We see, therefore, that the space of an equivalence class on S 3 is already quite nontrivial. 

There are three distinct cases: (i) R = (so that g is 'fully mixed', i.e. proportional to the 
identity); (ii) R — | (so that g is 'pure'); and (iii) < R < | (so that g is 'generic'). In case 

(i) , we have either t = ±l/v2 or t = (double root). For i = ±l/\/2 we have an antipodal 
pair on S* 3 , whose coordinates are (t, x, y, z) = (^, 0, 0, 0) and (— ^ 0, 0, 0), that give rise to 

the same density matrix; for t = we have an S 2 degrees of freedom given by x 2 +y 2 +z 2 = ~; 
all the points on this S* 2 determine the density matrix proportional to the identity. In case 

(ii) , we have t = ±|, and hence (t,x,y,z) = ±(~,b, c, a). The totality of pure states on 
S 3 is then given by a pair of two-spheres S 2 associated with t — |, x 2 + y 2 + z 2 = -r and 
t = — |, x 2 + y 2 + z 2 = j. In case (iii), the generic case, we have four distinct values for t 
given by f JTOj) . That is, two antipodal pairs on S* 3 give rise to the same density matrix (JSJ). 
Therefore, the space of mixed-state density matrix is given by the quotient of S 3 by the 
identification of two antipodal pairs. These configurations are schematically illustrated in 
figure [Q 

The utility of information geometry is not merely confined to the study of the structure of 
quantum state spaces; information geometry plays a crucial role in the theory of statistical 
estimation. In the quantum context, typically one has a system characterised by a density 
matrix p(6) that depends on a family of unknown parameters {0 1 }. To determine the state 
of the system one constructs a family of observables {€>*}, corresponding to the estimators of 
{#*}; by measuring these observables one estimates the unknown parameters. The degrees 
of error in these estimates are characterised by the covariance matrix tr[(G l — # l )(@ J — 
6 1 - 7 )/)(#)], where for simplicity we have assumed that the estimators {O 1 } are unbiased so 
that tr(G)*p(6 1 )) = 6 l . The lower bound of the covariance matrix is determined by the inverse 
of the quantum Fisher-Rao metric gry = Atr(diC,(9)dj^(6)) . In many applications the trace 
can be computed explicitly to determine the lower bounds for the estimates. 
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FIG. 1: (colour online) Manifold of 2x2 density matrices. The sphere S 3 of radius l/y/2 constitutes 
the covering space for the space of 2 x 2 density matrices. If we regard the i-axis as defining the 
'poles', then the density matrix g = |l corresponds to the 'north' and the 'south' poles, as well as 
the equator. Moving down from the north pole, at t = ±^ we have the latitudinal 'circles' S 2 given 
by x 2 + y 2 + z 2 = |, corresponding to the pure-state boundary. Crossing this boundary by further 
reducing the value of t we see a 'mirror image', where starting from pure state one approaches the 
fully-mixed state g = ^1 as t — > 0. The northern hemisphere of the S 3 is thus partitioned into two 
parts; the northern cap t > \ and the latitudinal band < t < \. Each point is identified with an 
antipodal point in the southern hemisphere, but in addition the points above and below t = \ are 
also identified in pairs; and finally, the equator is identified as a single point, also to be identified 
with the two poles. 

As a concrete example, let us work out the variance lower bound associated with the 
estimation of unitary (e.g., time) parameter. The problem is as follows. We have a system 
prepared in an initial state £o = £(0), which evolves under the Hamiltonian H so that 

it = exp(-iHt)i exp(iHt). (11) 

The task is to estimate how much time has elapsed since its initial preparation. Let T be an 
unbiased estimator for the time parameter t so that tr(T£ t £ t ) = t (strictly speaking, all we 
need is a locally unbiased estimator satisfying cVtr(T^£ t ) = 1) [17|. If we write T = T—tt for 
the mean-adjusted estimator, then the variance of T is AT 2 = tr(T 2 ^^). By differentiating 
the trace condition tr(£ t £ t ) = 1 in t and writing ^ = d t £ t , we obtain tr(£^) = 0. Hence by 
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differentiating tr(T£ t £t) = in t and using f' = — 1 we find 

tr[(T& + | t T)g] = l. 



(12) 



Recall that the matrix Schwarz inequality applied to a pair of Hermitian operators X and 
Y states that [tr(XY)] 2 < tr(X 2 )tr(Y 2 ). Squaring ([12]) and substituting X = + f t T 
and Y" = ^ in the Schwarz inequality, we deduce a version of the quantum Cramer-Rao 
inequality 



AT 2 + 8T 2 > 



1 



2tr(££) 



(13) 



where <5T 2 = tr(T^T^) — t 2 , and where the denominator on the right side is half the 
Fisher-Rao metric. By use of the unitary evolution = — i(H£t — £,tH) we find that 



tr(^#)-tr(ff&ff& 



(14) 



The right side of (I14p is twice the skew information introduced by Wigner and Yanase j9[. 
It is interesting that Fisher introduced the left side of (fill) , in the classical context, as 
the amount of extractable information concerning the value of t Iffy , whereas Wigner and 
Yanase introduced the right side of (I14p . in the quantum context, as an a priori measure of 
information contained in the state concerning "not easily measured quantities" (such as T) 
[§]. In special cases for which & is a pure state satisfying £ 2 = £ t , the skew information is 
maximised and (114j) coincides with twice the energy variance 2 AH 2 , while 5T 2 = 0, and we 
recover the uncertainty relation for the unitary parameter estimation as in 13]. For mixed 
states, however, we obtain 



AT 2 + 8T 2 > 



A(AH 2 -5H 2 ) 



(15) 



We note that for an arbitrary observable H the Wigner- Yanase skew information I p is 
defined by the expression 

(16) 



I p {H) = tl(H 2 p) - \x{EyJpEyJp\ 

whereas the quantity SH introduced above, which might appropriately be called the skew 
information of the second kind, is given by 



5H 2 = tT(Hy/pHy/p) - tr(Hp) 



(17) 



It follows that the total variance takes the form AH 2 = I p (H) + SH 2 , which can be inter- 
preted as the quantum analogue of the conditional variance formula (that the total variance 
is the sum of the expectation of the conditional variance and the variance of the conditional 
expectation). In particular, it is easily shown that AT 2 > 5T 2 > 0. By taking the upper 
bound for 5T 2 , one obtains AT 2 > l/4tr(£^), which is the uncertainty relation based on 
the skew information obtained by Luo [14J (see also 19]). This bound is not tight because 
the Luo relation gives AT 2 AH 2 > 1/8 for pure states, whereas f|T3l) gives AT 2 AH 2 > 1/4. 
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If we combine (II 5p with its dual inequality AH 2 + SH 2 > 1/4(AT 2 — 5T 2 ), we obtain a less 
tight but more symmetric expression 

AT 2 AH 2 > \ + 5T 2 5H 2 , (18) 

which implicitly appears in [20| . 

By exploiting the utility of information geometry, we can go beyond the lowest-order 
term in uncertainty relations. To this end, we remark that the Fisher-Rao metric that 
determines the uncertainty lower bound is just the squared 'velocity' of the evolution of 
quantum states. In the case of a pure state, the fact that squared velocity of the state 
evolution is given by the energy uncertainty is known as the Anandan-Aharonov relation 



21[. The observation of Luo can be paraphrased by saying that this velocity, in the case of 
a generic mixed state, is given by the skew information. In addition to the squared velocity, 
we consider the squared 'acceleration' a t = — tr(£"£ t )£ t , which determines the curvature 
7 2 = 16tr(dt6;t)/tr(£^) of the curve in the space of square-root density matrices. The 
squared acceleration determines the next-order correction to the uncertainty relation. More 
generally, we can obtain arbitrarily many higher-order corrections by considering higher- 
order derivative terms, thus generalising the quantum Bhattacharrya bounds derived in 
0, B for pure states to general density matrices. 



To illustrate how higher-order corrections to uncertainty relations can be obtained sys- 
tematically, let us examine the second-order and the third-order corrections here. For this 
purpose, it will be convenient to relax the unit trace condition trp = 1 but merely require 
trp 7^ and trp < oo, and impose unit trace at the end of the calculation. We then define the 
symmetric function = tr(£T£)/tr(££) of £ and consider the gradient V£ of this function 
in the space of square-root density matrices, evaluated at tr(££) = 1. A short calculation 
then shows that the squared magnitude (in the Hilbert-Schmidt norm) of the gradient is 
| Vt| 2 = 2 (AT 2 + 5T 2 ). On the other hand, the squared magnitude of the gradient is clearly 
larger than (or equal to) the squared magnitude of its component in the direction of and 
thus AT 2 + 5T 2 > l/2tr(«^), where we have made use of the relation i[H, T] = 1. This 
provides an alternative derivation of the quantum Cramer-Rao inequality. 

The next order correction to the uncertainty lower bound for mixed states is obtained from 
the component of Vt in the direction of a t , given by tr((T^ + £ t T)at) / tr(a t a t ) . Hence if the 
curvature 7 of the curve £t is large, the correction is small. The numerator, however, in gen- 
eral depends on T. To seek a bound that is independent of T we thus consider the third-order 
term, i.e. the component of the gradient in the direction of (3 t = — (tr(£"^)/tr(£^))£p 
This is given by tr((T^ + t; t T) (3 t ) / tr(f3 t f3 t ) . Although the calculation of this term is straight- 
forward, the resulting expression is lengthy and will be omitted here. It suffices to note that 
each of the odd-order corrections involves commutators between T and H k for some k, and 
thus is independent of T; whereas all even-order corrections are in general dependent on 
T. Furthermore, terms appearing in the corrections have specific statistical interpretations 
in terms of the quantum extensions of central moments of the Hamiltonian. In the pure 
state limit where £| = t; t , these expressions reduce to the "classical" central moments, in a 
manner analogous to the reduction of the Wigner-Yanase skew information to the second 
central moment AH 2 . We are thus able to obtain various higher-order generalisations of 
the skew information, which appear to be entirely new quantities in the study of statistical 
properties of mixed states in quantum mechanics. These quantities might appropriately be 
called quantum skew moments. 
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